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Abstract 



^ ' We present a new method for estimating the edge of a two-dimensional bounded set, given 

a finite random set of points drawn from the interior. The estimator is based both on Haar 
series and extreme values of the point process. We give conditions for various kind of conver- 
, gence and we obtain remarkably different possible limit distributions. We propose a method 

' of reducing the negative bias, illustrated by a simulation. 
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c/3 : 1 Introduction 

Many proposals are given in the literature for estimating a bounded set S of M^, given a finite 
' random set N of points drawn from the interior. Such a diversity follows from crossing properties 
. of the observed random set N (sample, point process, random field on a grid), properties of 
CK ! the unknown bounded set 5* (convex sets, star-shaped domains, pieces of support under a given 
' curve, images), with the relevant treatments (convex hulls, extreme values, functional estimates, 
^ ■ change-point estimates, segmentation). 

Here is a Poisson point process, with observed points belonging to a subset S = {{x,y) : 
0<x<l; 0<y< f{x)}, where / is an unknown function, so that the problem reduces to 
estimating /. This problem arises for instance in econometrics where the function / is called the 
production frontier. This is the case in Hardle et al. (1995b) where data consist of pairs (Xj, 1^), 
Xi representing the input (labor, energy or capital) used to produce an output Yi in a given firm 
i. In such a framework, the value f{x) can be interpreted as the maximum level of output which 
is attainable for the level of input x. 

An early paper was written by Geffroy (1964) for samples where the estimator was a kind of 
histogram based on extreme values. More precisely, taking the equidistant partition of [0, 1] into 
kn intervals, this estimator ^„ is defined by choosing on each interval the uppest observed point. 
Some developments were achieved by Chevalier (1976), Gensbittel (1979), and Jacob (1984). Es- 
timating a smooth star-shaped contour is a closely related matter which was adressed by Jacob 
and Abbar (1989), Abbar (1990) and de Haan and Resnick (1994) with extreme values methods. 
In Deprins et al. (1984) and more recently, in Korostelev (1995), / is from economical considera- 
tions supposed to be increasing and convex which suggests an adapted estimator, called the DEA 
(Data Envelopment Analysis) estimator. Asymptotic minimax optimality of piecewise polyno- 
mials estimates is studied by Korostelev and Tsybakov (1993), Mammen and Tsybakov (1995), 
Gayraud (1997) and Hardle et al. (1995a). Let us note that a precise knowledge of the regularity 
of the function subject to estimation is a necessary condition to obtain this optimality. If it is only 
known that / is a function, this estimator does not differ from the estimator gn of Geffroy. 
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The first attempt in the direction of orthogonal series was done by Abbar and Suquet (1993), 
and more thoroughly by Jacob and Suquet (1995). The method, due to Cencov (1962) and Tiago 
de Oliveira (1963), consists of estimating the first coefficients of a truncated L^— expansion /„ of 
/. The case of the Haar basis, the general case of bases, and the example of the trigonometric 
basis were investigated, in the case of Poisson process with a known intensity c, permitting the use 
of averaging estimates of the Fourier coefficients. In the present paper, the intensity is supposed to 
be unknown, and we present extreme- valued estimates of these coefficients in the case of the Haar 
basis. Then, our estimates will appear as a linear combination of a set of A;„ extreme values, 
involving the Dirichlct kernel associated to the (hn + 1) first Haar functions. It is convenient to 
consider kn as an integer multiple of {hn + 1)- In the case (/i„ + 1) = kn, fn turns out to be simply 
the estimator gn of Geffroy. In the general case kn = dn{hn + 1), with dn oo. This is one of the 
advantages of /„ to be simple to compute. Moreover, it does not require either the knowledge of 
the intensity of the point process or information on the regularity or geometry of the function. 

Of course, it is understood in the sequel that hn ^ oo so that kn — > oo, whatever the choice 
of dn- In sections 3 and 4, we produce mild conditions for the convergence of /„, in terms of local 
error, mean integrated square error, and for the uniform norm. In subsection 4.1, the choice of 

= 1 or — 7- oo appears mainly as a conflict between the systematic part of the bias, f — fn, 
and the variance Var/„, while the statistical part of the bias, E/„ — /„, remains unaffected. 
The choice dn = 1 privileges the systematic bias, but the choice dn ^ oo improves the variance. 
In the case c?„ = 1, the limit distribution of /„ is extremal, while in the case — >■ oo, /„ is 
asymptotically normal, with a better speed of convergence. 

In fact, the most awkward part of the bias, in either method, is a negative component of the 
statistical part E/„ — /„, which is of order fc„/(nc). Recall that c is assumed to be unknown. 
In section 5, we present a very simple and efficient method of eliminating this component. The 
resulting improvement is quite evident on simulations, see Girard and Jacob (2003). What is 
better, we obtain in this way the asymptotical distribution for fn — f instead of — E fn- 



2 Preliminaries 

Let be a stationary Poisson point process on M? with an unknown intensity rate c > 0. Actually, 
suppose that we merely observe the truncated point process A^(. R S) where 

S = {ix,y)eR^\0<x<l;0<y<f{x)}, (1) 

and where / is a measurable function such that 

< m = inf / < M = sup / < +oo. (2) 

[0,1] [0,1] 



For each positive integer i, let us denote by Jj the interval Jj 



where pi and qi 



are the integers uniquely determined by i = 2'^'~^ + Pi and < pi < 2'^*"^. For f = 2*^* — 1, let 
us close Ji on the right. In what follows we suppose that hn + I = 2^", h'n G N, so that {Ji, 
+ 1 < ^ < 2/t„-|-l} appears as a partition of the unit interval into (/i^ + l) equidistant intervals. 
For each x G [0,1], denote Ji{x) the interval that contains x- Let (A;„) be a further increasing 
sequence of integers such that kn = dn{hn + 1), where dn G N, d„ > 0. Take the equidistant 
partition of the unit interval into kn intervals In,r, r = 1, - - - ,kn centered at Xr = (2r — 1)/(2A;„), 
and the associated partition of S into kn cells Dn,r with a basis In,r- 

Dn,r = {{x,y) e S \ x e In,r }■ (3) 

Then, writing A^^^ for the Lebesgue measure X{Dn,r), the piecewise constant function 

fn{x) = {hn + l) Yl Vr, X G [0, 1] (4) 



2 



appears as an approximation of /. 

Let A^*" denote the sum of independent Poisson processes with the same intensity rate c. A'^*"' 
can be considered as a Poisson process with mean measure nc\. For r = 1, . . . , let X*^ = if 
N*^{Dnjr) = 0. If N*"'{Dn^r) > 0) let X* denote the supremum of the second coordinate of the 
points of the truncated process 7V*"'(. HDn^r)- We estimate each A„_r by X* j./kn, thus obtaining 
an estimator of /„ which is 

/"(^) = ^ E Kr^ XG[0,1]. (5) 

For the sake of brevity, we write, for r = 1, . . . , kn, Yn^r = (-^n r/^n) ~ ^n,r, obtaining the reduced 
formula 

fnix)-fn{x) = {hn + l) ^ Yn,r, X e [0,1]. (6) 

In a first reading of our paper, this matter-of-fact presentation of /„ is sufficient. However, the 
deep motivation of this definition is that /„ belongs to the important family of orthogonal series 
estimators. Moreover, in most proofs, /„ is easy to handle in this setting. 

As a square integrable function on [0, 1], / admits a L^-convergent series expansion with respect 
to every orthonormal basis of L^{[0, 1]). In particular, the Haar basis is defined by 

eo = l[o,i], = 2^ - , i>l. (7) 

The Dirichlet kernel of order hn associated to the Haar basis is given by: 

Kn{x,y) = ^ei{x)ei{y) = {K + (8) 

1=0 

Now, let X^^o ^i^i be the L^-expansion of /, where Oj = ei(t)f(t)dt. The truncated expansion 
Yli=o (^i^i{x), X G [0, 1] can be rewritten in terms of the Dirichlet kernel as Kn{x, y)f{y)dy. In 
view of (4) and (8), we obtain: 

V aidix) = / Knix, y)f{y)dy = {K + 1) / f{y)dy = x G [0, 1]. (9) 

Note also that fn{x) can be written on the Haar basis as 



hn 

fn{x) = ^ai,fe„ei(x), X e [0, 1], (10) 

1=0 



where di^k^ is an estimate of the coefEcient Oj defined by 



X, 



★ 



hkn = E (^ii^r)-^, 0<i<hn. (H) 

r=l 



A straightforward calculation leads to the formula 

fn{x)=J2^n{Xr,x)^, XG[0,1], (12) 

r=l '^'^ 

which provides a general expression for orthogonal series estimates of /. Before proceeding, we 
collect some further notations to be used in the sequel: 

min f{x) = mn,r, sup f{x) = Mn,r- 

X€ln,r X&I„ 
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3 Systematic bias convergence 

According to section 2, is an estimate of the approximation /„ of the unknown function /. 
Thus, a preHminary examination of the convergence of the systematic bias /n — / is required. We 
need the modulus of continuity of /: 

a;(/,,5) = sup{|/(a; + r/)-/(a;)|; < x < x + rj < 1; 0<ri<S}. 

Then, 

|/n(x) - fix)\ < {K + 1) / - f{y)\dy < ivif, 1/{K + !))• (13) 

•Ijlix) 

In view of the last inequality, the following proposition is clear: 

Proposition 1 If f is a-Lipschitzian (0 < a < 1), then \\ fn — f |loo — ^ (^n")- 

The convergence of || fn — f II2 is easily deduced from Proposition 1 since || /n — / II2 < || /n — / |lo 

Proposition 2 If f is a-Lipschitzian (0 < a < 1), then || /« — / II2 = O {h~'^°^). 

Notice that, from (9), fn — f merely the remainder of the partial sum of order hn of the Haar 
series of /, so that /i„ = o (n) is a sufficient condition for both the L^-convergence and the uniform 
convergence when / is continuous. 



4 Consistency results 

For r = 1, . . . , kn, the truncated Poisson point processes N*'"'{. D Dn,r) are independent, and the 
extreme values X*^^, inherit this property. Thus, the distribution of the estimator /„ is uniquely 
determined by the distribution of each X*^^. We refer to the Appendix for a series of lemmas 
describing these distributions. 



4.1 Local convergence 

Theorem 1 Suppose f satisfies a Lipschitz condition of order a, (0 < a < 1). Then, under the 
condition fe„ = o (n/ In n), we have for all x G [0, 1] .• 



Efnix) - fn{x) + 

no 



(14) 



//, moreover, n = (A;^"*"") then 



Varfn{x) 



2 2 ■ 



(15) 



The condition n = (fe^"*"") is necessary to control the variance of X*^ (see Lemma 2 (61)). It 
imposes that, for all r = 1, . . . , kn, the mean number of points above mn,r in Dn,r converges to 0. 
It is then possible to ignore the contribution of the events {X* > mn,r} to the variance. 
Proof : In view of Lemma 2 (60), 



max 

l<r<fe„ 



E (X;,) - mn,r + ^ 



(16) 



Thus, since / is a-Lipschitzian, 



E/n(x)-/„(ar) + 



kn 



no 



= O 



(17) 
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Moreover, in view of Lemma 2 (61), 

Var/„(x)=Var ((/.„ + !) E ^] = ^ 11 ^^Kr^^^- (18) 



Of course, for d„ = 1, the variance is the worst since knhn/ (n^c^) ~ k'^/ (n^c^). (See the discussion 
on the choice of dn in the introduction.). 

4.2 Mean integrated square convergence 

Theorem 2 Suppose f satisfies a Lipschitz condition of order a, (0 < a < 1). Then, for the 
mean integrated square convergence of fn io f it is sufficient that kn = o(n/ Inn). 

Let us note that the condition kn = o(n/lnn) is necessary for the apphcation of Lemma 2 (59). 
Proof : Utihzing Fubini's theorem, the mean integrated square error can be written as 

J{fn) = E(/„(x) - f{x)f = E (ll U-f . (19) 

Taking account of the L^-orthogonality of fn — fn and fn — f, '^^ obtain 

J{fn) = E (11 /„ - /„ + II /„ - / 11^, (20) 



where the second term goes to zero by Proposition 2. In view of (6), the first term can be 
expanded as 

2/i„+l 

E(||/n-/n||2) =(^n + l) ^ ^ 



=hn+l 

Prom the inequality 



2- 



(21) 



E(y„,.Fn,.) < J(E(y„2 ) + E(y„2 )) < ^ax E(y„2,,), (22) 



we obtain 



E (11 fn - fn II2) < max E(y„2 ), (23) 

\ / l<.t<k„ 

and the result follows from Lemma 2 (59): 

(ll/„-AII^)=0(§)+0(>). (24) 



E 



4.3 Almost complete uniform convergence 

We shall give sufficient conditions for the convergence of the series 

+ 00 



V£>0, ^p({||/„-/||^>£}) <+oo. (25) 



n=l 



Theorem ^ If f is continuous, then under the condition kn = o(n/lnn), fn converges to f 
almost completely uniformly. 
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Proof : The condition A;„ = o(n/lnn) implies /i„ = o (n) and consequently the continuity of / 
yields the uniform convergence of /n to /. Thus, it remains to consider fn — fn, which is given 
by (6). Then, we have 



(26) 



which implies 

^'({ll/n-/n|loo>4) ^ P 



> 



2h„+l ( { ( 

s E ^' U W\>j 

i=h„+l \ {xrGJe ^ ^' 



iK + 1) 



(27) 



where d„ = kn/{hn + 1) represents the cardinality of {xr G Je}. Then, 

2h„+l 

P{{\\fn-fn\\^>e])< Y.P^{\Kr-kn\n,r\>e]). 

i=hn + l Xr&Jl 



(28) 



Since / is continuous, for n large enough Mn^r — 'n^n,r < s/2 uniformly in r, and therefore 
{\^n,r - knK,r\ > e] C [X^^^ < m„,^ - e/2} • This yields 



2/i„+l 



(29) 



In view of Lemma 1, Fn,r{^n,r ~ s/2) can be computed, leading to 



2/i„+l ^ 
^({ll/n-/n|loo>4) ^ E E«^P( 



nce\ , / nce\ 



i=hn + l Xr€Je 

which is the general term of a convergent series provided kn = o (n/lnn). 



(30) 



4.4 Asymptotic distribution 

It is worth noticing that in convergence Theorems 1, 2, and 3, conditions kn = o (n/lnn) and 
n = o {k^'^'^) do not imply that kn should go to infinity faster than hn- On the contrary, the 
asymptotic distribution of /„ depends strongly on the relative rates of hn and kn- We present 
two entirely different cases. In the first one ((i,„ = 1), the partition {Ji, hn 1 < i < 2/i,„ + 1}, 
is arranged to be exactly the same as the partition {In,r, ^ ^ r < kn}, so that the estimator /„ 
merely reduces to gn, the Geffroy's estimate, a kind of histogram based on the uppermost values 
of the point process on each In,r- Such a case has already been extensively studied since the 
generating paper of Geffroy (1964) and gives extremal asymptotic distributions. The second case 
(dn oo) is more in the spirit of the orthogonal series method, with a condition /i„ = o (fc„) 
suggesting an estimation of the coordinates = ei{t)f{t)dt by the random Riemann sums 
Si.fen) see (11). Due to the underlying properties of the Poisson process TV*", these Oj^fc^ are sums 
of independent terms, so that a Gaussian asymptotic distribution is expected. However, in both 
theorems, a closer control of fc„ is required, which is essentially of the same kind as n = o (fe^"*"") . 



6 



Theorem 4 Let x G [0,1] and suppose f is a-Lipschitzian (0 < a < 1). Suppose (i„ = 1 and 
n = o(A:i+"). 

(i) If kn = o{n), then Tn{x) = -^{fnix) — knXn,r) converges in distribution to the Weibull 
Extremal Value Distribution. 

(a) Suppose knlnkji = o{n) and denote Zn = max sup \fnix) — Mnr\, then T^Zn — Infcn 

l<r<kn x&In,r 

converges in distribution to the Gumbel Extremal Value Distribution. 

Let us note that in (ii), condition A;„lnA;„ = o(n) is just a weaker version of the condition 
kn = o(n/lnn). Proof : 

(i) Denote by Gn,x the distribution function of Tn{x), and consider r such that x G In,r- Then 
for all uEM, 

Gn,x{v) = P (^|/n(a;) < knXn,r + ukn/ncYj = Fn,riknK,r + ukn/uc), (31) 

since reduces to X* ,, on In,r when dn = 1. Suppose for instance that u < 0. Then 

kn = o {n) implies nX^^r ~^ and thus for n large enough, 



u G 



-ncXn,r,nC I -r^ - Xn,r 



Mn 
kn 



Lemma 1 yields Gn,x{u) = exp (u — nc^n,r{u)) with 

Cn,r{lJ') — ^n,r{knXn,r + ukn/nc) — Xn^r + ^n,r/kni (^2) 

and < nc£^n,r{'u) < nc{Mn^r — mn^r)/kn- Then ^n,r — >■ as n — >■ cx) under condition 
n = a As a conclusion, Gn,x{u) — exp(u) if u < 0. The case u > is similar, the 

result being Gn,x{u) 1. 

(ii) When d„ = 1, fn{x) reduces to X*^. on In,r, and thus Zn = max {Mn,r — ^nr)- 

' ' l<r<kn ' ' 

stand for the distribution function of — lnA;„. Consider u G M. For n large enough 



u G 



1 ; '^^ ( knlnkn 
In kn, -j- Mn,r 

kn \ no 



Vr G {1, . . . , kn} and by a straightforward computation 



Gn{u) = J] Fn,r ( M,,, - ^{u + lukn)) = - e"''"" V ) ' (^3) 



with nc{Xn,r — Mn,r/kn) < iln,r < 0. Now, TT j 1 — j — > , rjn,r is a null array in 

r=l \ '^n J 

view of n = o (/c^"*""), and Lemma 4 (see Appendix) gives the result. ■ 
Now, let X G [0, 1] be fixed. Denote (T„ = kn/{ncdl/^). 

Theorem 5 If f is a-Lipschitzian (0 < a < 1), under the three conditions hn = o{kn), kn = 
a (n/ Inn) and n = o (^kl/^'^'^hl/^^ , Vn{x) = CTn^{fn{x) — E{fn{x))) converges in distribution to a 
standard Gaussian variable. 
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Proof : Let us denote by V'n,x the characteristic function of Vn{x). It expands as 



k rl ' 



X E 



h rl ' 

Knan XreMx) 

Introducing a^^r = "^n,r ~ kn/nc, (34) can be rewritten as 



(34) 



i^n,xit) = expit I (a„,^ - E(X* ,.)) 



X E 



Xr&Je{x 

nc 



, ,1/2 



(35) 



In view of Lemma 2 (60), the first term in the product converges to 1 under conditions hn = o {kn), 
kn = o(n/lnn) and n = o (jtu^^^^hj/^^ ■ After simpUfication, and in view of the independence 
of the X* 

V'n,x(t) ~ e^*'^" ' n ^ expiJ^^(X*,-m„,,) ) . (36) 

XreJf(x) L \kndn j 

Introducing the characteristic function of X* — mn,ri it follows that 



(37) 



— 1/2 

where = tncdn /kn- Before applying Lemma 3, it remains to verify that i„ = o{n/kn) and 
tn = o (4+2«/ra) . Clearly, 



nc 



since hn = o (kn), and 



n 



1+a 



= o{l), 



since hn = o {kn) and n = o (/c^^"). The characteristic function is of the order: 
^n,a.(t) ~ exp (itdl/^) (1 + 

X n ( 1 + itd-^''^Fn,Mn,r) + O ( 71^2^^) + 



(38) 
(39) 

(40) 
(41) 



By a third-order Taylor expansion, one sees that (40) converges to e as d„ — )■ +oo. Finally, 
consider the logarithm of (41): 



Wn= Yj 1^(1 + ^n,r)^ with Un,r = M^^''^ Fn,r{mn,r) + O i \tn\ ) + O (n . (42) 



8 



Observe that maxi<r<fe^ h^n.rl converges to with (38) and (39). Thus, for n large enough 
\un,r\ < 1/2 uniformly in r and the classical inequality |ln(l + ■Un,r)| < 2 \un,r\ yields 

\Wn\ < 2 |i| dl/''Fn,r{mn,r) + O dn^^^ + O (ci„n-^) . (43) 

Using Lemma 1, we have Fn^r{fnn,r) = O (ra/fc^"'"") leading to 

Therefore, choosing s = 2 entails W„ — )■ and ipn,x{t) e~* as n — >■ cxd. ■ 



5 Bias reduction 

Theorem 5 is not convenient for providing confidence intervals for f{x), since it only gives the 
asymptotic distribution of a~^{fn{x) — E(/„(x))). A quick calculation shows that even in the 
case f{x) = a > 0, there is no hope of a similar result for a~^{fn{x) — a). Indeed, 

E(a-X;,J^^(l-exp(-=))>i^ (45) 

for n large enough. Thus, for every x G [0, 1], under the hypotheses of Theorem 5: 

a-\a-E{Ux))) = a-'l- ^ (a - E(X;,)) > i^^^ = irfV^ ^ oo. (46) 

xreJe{x) 

Define fn{x) = fn{x) + kn/{nc). Since c is not supposed to be known, /„ is of no use in practice. 
Nevertheless, it is a first theoretical step to introduce an estimator with a smaller bias and to 
study its asymptotic distribution (see Theorem 7). 

Theorem 6 If f is a-Lipschitzian (0 < a < 1), under the three conditions hn = o{kn), kn = 
o{n/\nn), and n = o , c7^^(/„(x) — f{x)) converges in distribution to a standard 

Gaussian variable. 

Proof : The proof is based on the expansion 

^nHfn -f)= a-\fn - E{U)) + a-\E{U) - fn)+0-\fn " /)• (47) 

Theorem 5 shows that the first term converges in distribution to a standard Gaussian variable. 
Now, from Lemma 2 (60), 



max 

l<r<fc„ 



E [XI,) + ^ - knXn, 

' nc 



O(t^), (48) 



and we obtain. 



(E(/n(x) - fnix)) = a-HK + 1) E { + ^^- Kr \=0 \ ^ , ^ ) • (49) 



Xr&Jl{x) 

Finally, let us consider the third term. In view of Proposition 1, 

ncdn"^ ^ /, ^ ( n 



a-' \Ux) - fix)\ = — (/^r) = O ^ 0. (50) 
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We are now ready to introduce a very simple ad hoc estimate of kn/{nc): 



(51) 



and Zt ^ is the infimum of the second 



where Z* ,. is defined by Z* ,. = if iV*"(L>^ 
coordinates of the points of the truncated process A/"*"(. n -Dn,r) if N*"'{Dn^r) > 0. Define 



fn{x) = Y^Kn{Xr,x) ( 

r=l ^ '^'^ 



(52) 



The moments of Z*^^ can be expanded similarly to that of X* ,^, leading to 
E(Z„) = — + o (n-') , Vs > and Var(Z„) ~ 



(53) 



We now prove the following theorem which states that /„ inherits its asymptotic behavior from 

fn- 

Theorem 7 Under conditions of Theorem 6, a~^{fn{x) — f{x)) converges in distribution to a 
standard Gaussian random variable. 

Proof : In view of Theorem 6, it suffices to verify that a~^{fn{x) — fn{x)) converges to in 
probability. We shall show that for all £ > 0, 



P \fnix) - fnix)\ >^})<P 



Zn 



k 



nc 



> e 



converges to zero as n goes to infinity. To this end, consider the expansion 





nc 



nc 



(54) 



(55) 



In view of (53), the second term is bounded from above by e/2 for n large enough, and therefore 
p({a-i|A(x)-/„(x)| >£}) < P({c7-i|Z„-E(Z„)|>£/2}) (56) 



< ^Var(c7-iZ„) = 0(l//i„). 



(57) 



with (53). ■ 

For instance, in the case where / is continuously differentiable (a = 1), one can choose hn = n}^'^ 
and kn = n^/^(lnn)'^ for all e > arbitrarily small. Theorem 7 shows that fn{x) converges to 
f{x) with the rate n^^/^(lnn)^ for all x € [0, 1]. 



6 Conclusion and further developments 

In this paper, the underlying Poisson point process is supposed to be homogeneous. It is easily 
seen that all convergence theorems of section 4 still hold if the intensity is an arbitary function 
on 5*, bounded from below by a constant c > 0. Such an extension for limit distributions is not 
straightforward. We published a companion paper devoted to C^-basis estimate. This study, 
based on formula (12), can be found in Girard and Jacob (2003b). This case involves rather 
different difficulties due to the Dirichlet kernel feature. However, this provides smooth estimators 
more adapted to regular boundaries. 
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Appendix: Auxilary results 



In the following lemma we give the distribution function of X*^ < r < kn), with an accurate 

approximation determined by the bounds run^r and Mn,r of / on In^r (the notations are those 
introduced in section 2). The proof is straightforward after noticing that, for every measurable 
set B, P(iV™(5) = 0) = exp (-ncA(5)). 

Lemma 1 The distribution function of X* ^ (\ <r < kn) is given by 



ifu<0, 

exp ^{u- Mn,r) - ncen,r{u) ifO<U< Mn,r, 

1 if Mn,r < U, 



(58) 



with en,r{u) = \n,r - , if < U < mn,r, and \n,r " < £n,r{u) < 0, if mn,r <U< Mn,r- 

An approximation of the first moments can be derived from the distribution function given in 
Lemma 1: 



Lemma 2 Suppose f is a-Lipschitzian (0 < a < 1) and k^ = o(n/lnn). Then, 



max 

l<r<fc, 



(59) 



and 



maxJ^(x;,)-a„,|=o(^), 



where an,r = 'mn,r ~ V moreover n = o (A:^"*""), 



(60) 



Var{Xl,) 



l<r<kn. 



(61) 



Proof : 

• Observe that 



E 



{Xl,-knXn,r) = A;Xr^(^n,r = 0) + / (u - knXn,r f F;,^,{u)du 





+ / {u-knXn,rfFn,r{du). (62) 



The distribution function F^^j- is given in Lemma 1 using £n,r-i a non-decreasing function on 
[0, Mn^r] ■ Then, the derivative ^ exists almost everywhere and is non-negative. We thus 
obtain 



TIC 



a. e. 



(63) 



with equality for < u < mn,r- The following bound may be found quite easily: 



E 



2fc2 



n^c^ 



(64) 



for r = 1, . . . , and (59) is proved. 
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• Prom Lemma 1, 



l"mn,r 


1 — exp ^ 


Jo 





nc 



/ 

•Jmn,r 



du+ Fn,r{u)du, 



where the last integral is positive and less than (M„^r — 'mn,r)- Therefore, for r = 1, . . . , 



mn,r + ^e-"^^"--- (l - e""^ ) < E(X*^) < M„,^ + ^e""'^^"''" (l - e"^^) , (65) 

TtC V / The V / 



and consequently 



E (X*^) - an,r \ < iMn,r - m„,^) + — 1 + e-"^^"'"^ - e-'*^(^".'-"*".'-/'=") . (66) 



nc 



The first term of the sum is obviously O (1/k!^), uniformly in r. Furthermore 



1 + e 



~7lcXnjr 



< e 



n 



+ 



nc{Mn,r - mn,r 



uniformly in r and (60) is proved. 
• (61) is established with similar techniques. ■ 

The expression of the characteristic function can also be deduced from the distribution function 
given in Lemma 1: 

Lemma 3 Suppose f is a-Lipschitzian (0 < a < 1) and kn = o{n/lnn). Let (tn) G M 6e a 
sequence such that tn = o{n/kn) and tn = o {kl^'^°' / . Then, the characteristic function of 
(X*^ — run^r) can he written at point tn as: 



*n,r-(in) = 



1 + itn^Fn,rimn,r) + O ^TT5^) + O (n"^) 



1 + itn — 

' nr 



with s arbitrary large, and Fn^r = 1 ~ Pn,r < f < kn)- 
Proof : Consider the expansion 



E(e'*"^n.-) = P(X*^ = 0) + / e'''*-F^^^{x)dx + / e^^*"F„,^(dx). 

JO J m„ r 



(68) 



The first and second term can be computed explicitely since Lemma 1 provides an expression of 

Fn,r on [0,mn,r]- 



E(e^*"^"-) = e-"^^"- + r + / e'-'-Fn,r{dx). (69) 



1 + itn 



The third term expands as 



/ e''''"Fn,r{dx) = e''""'->'^FnA^n,r)+ (e^^*" - e^*"™"-'^)F„,,(dx), (70) 
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with |e*^*" - e'*"'""'''! < {M^^r — mn,r) |*n|- Thus 



J mn,r 



nc , 



n 



O { \tnl i+2a 



(71) 



Now, take the common denominator in (69) and replace (70) and (71) in the numerator. Notice 
that kn = o{n/lnn) yields e""'^^"''' = o(n~*), Vs > and multiply everything by e"'*"'""-'' to 
conclude the proof. ■ 

In order to be more self-contained, we quote here Lemma 4.1 of Jacob and Abbar (1989) which is 
used to prove Theorem 4(ii). Let us recall that a triangular array {r)n,r, r = 1,. . . , kn} is called 
a null array if max |r/n,r| — >^ as n ^ oo. 

Lemma 4 Let {xn,r-, r = 1, . . . , A;„} he a positive triangular array such that {zn,r} = {e^^"-''} 
is a null array. If JJ(1 — -Zn,r) has a limit £ > 0, then JJ(1 — Zn^r^^"'"') has the same limit £, 



r=l 



r=l 



whenever {rin,r} is a null array. 
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